Draft of Keras pickle RFC #1

stsievert · 2020-09-02T21:57:12Z

What does this PR implement?
A draft of the Keras pickle edits. I'm submitting this PR because I want to discuss these changes and don't want add a bunch of unnecessary comments/notifications/reviews that the TF maintainers have to sift through.

This PR should be merged when the first draft is done.

Reference issues/PRs
This is a draft for tensorflow#286

adriangb · 2020-09-02T23:02:47Z

I'm sorry that I just saw this and in the meantime made edits to the original. My apologies. I will try to incorporate your version into mine.

adriangb · 2020-09-02T23:11:32Z

Sorry for rewriting the history @stsievert . I incorporated what you wrote into what I had. We can continue working here until we have something more final that we can upstream.

adriangb · 2020-09-03T01:11:58Z

I think I'd like to also include a monkey patched example for using Model.save, using a string to transfer back and forth, just as a POC.

adriangb · 2020-09-03T03:48:25Z

POC for using SaveModel as a backend for pickle is up in the notebook: https://colab.research.google.com/drive/1SEGXDFfNl5i0Cy8a4xvRjWOOjdq3sOOC?authuser=1#scrollTo=VZIkAZyJE1CP

adriangb · 2020-09-03T15:04:10Z

Anything else you think we need to add @stsievert?

stsievert · 2020-09-03T15:14:32Z

I think so. I’d like to make some edits to better frame the problem. I’d also like a chance to review the technical content.

…

On Thu, Sep 3, 2020, at 10:04 AM, Adrian Garcia Badaracco wrote: Anything else you think we need to add @stsievert <https://github.com/stsievert>? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAKCMG442XOMKXBKPY3XVFLSD6V7XANCNFSM4QT26NKQ>.

adriangb · 2020-09-03T15:25:25Z

Okay, I'll let you do the next round of edits before I make any more changes.

stsievert · 2020-09-03T17:06:40Z

I made some edits to the motivation and user benefits section. I'd still like to review the technical content.

adriangb · 2020-09-03T17:38:27Z

Edits look great. And thanks for fixing the line lengths.

stsievert · 2020-09-03T20:52:26Z

Why implement __reduce_ex__? __reduce_ex__ only exists to support different Pickle versions (docs). AFAICK the argument protocol isn't used. Why not __reduce__?

I'd still like to make a couple more edits to this RFC. I'll try to make the edits later tomorrow. Feel free to make some edits; I revised the technical section.

adriangb · 2020-09-03T22:18:07Z

Two main reasons:

I recall running into issues using just __reduce__ when I first tried that. I think it was breaking some test that implemented it in subclassed models, but I do not remember tbh. Might be worth trying again.
I think it's likely that they will want to use that protocol version in the future, ex: to map it to different versions of the SaveModel protocol.

stsievert · 2020-09-04T05:47:17Z

@mrocklin we're proposing to add Pickle support to Tensorflow. Could review our proposal? It's at 20200902-pickle-for-keras.md. I'd appreciate any of your thoughts, including any ideas on other problems Pickle support would solve for Keras.

In this, I lifted some wording about ecosystem support from your post "Pickle isn't slow, it's a protocol." Here's the relevant paragraph:

Supporting Pickle will enable wider usage in the Python ecosystem because Python's ecosystems of libraries depend strongly on the presence of protocols. Without these protocols, it's necessary for each library to implement a custom serialization method for every other library. For example, Dask Distributed has a custom serialization method for Keras at distributed/protocol/keras.py. See "Pickle isn't slow, it's a protocol" for more detail (notably, this post focuses on having an efficient Pickle implementation for PyTorch).

Let me know if I should change anything in that paragraph.

mrocklin · 2020-09-04T17:09:49Z

I'm happy to see this work. I'm a bit slammed at the moment, but maybe instead of reviewing I could try to conscript @jakirkham ? He has been thinking a lot about serialization recently, and I think that it would be good to have someone from the RAPIDS team engaged regardless.

jakirkham · 2020-09-09T02:47:40Z

Would suggest leveraging pickle protocol 5 where possible to avoid copying frames before sending them over the wire. This is something Dask ( dask/distributed#3784 ) is able to leverage for zero-copy transmission.

jakirkham · 2020-09-09T18:33:48Z

Yep, NumPy also supports the pickle5 backport package. So if that's available, it will work for Python 3.5+.

Exactly this is why we do that as well. NumPy is a common dependency. So most people have it as well.

jakirkham · 2020-09-09T21:00:49Z

cc @quasiben (for vis)

rfcs/20200902-pickle-for-keras.md

adriangb · 2020-09-18T19:38:51Z

Hi folks, let's make a push to get this wrapped up this weekend so we can submit the PR in the TF repo sometime next week and start to get some feedback on that side. Please let me know if you see anything that would be a blocker to this.

stsievert · 2020-09-19T03:47:14Z

Doesn't the implementation rely on NumPy's Pickle 5 support? So can't __reduce_ex__ be replaced with __reduce__?

I've deleted the metric tests. It's a minimal implementation. I think it's obvious that the tests will pass with NewMetric.

adriangb · 2020-09-19T03:48:25Z

rfcs/20200902-pickle-for-keras.md

 from tf.keras.models import load_model

-class Model:
-    ...
+class NewModel(Model):


I'm curious as to why the name change? I was envisioning these code blocks as representing "here's a pseudocode of what this would look like if implemented in TF" and not necessarily "here's how users can create a picklable Model" which is the first thought that came to mind when I saw NewModel(Model).

adriangb · 2020-09-19T03:51:44Z

Doesn't the implementation rely on NumPy's Pickle 5 support? So can't __reduce_ex__ be replaced with __reduce__?

I was thinking that, but I wasn't 100% sure and I also had the thought that (1) this is a private ecosystem method, not something users will really be looking at so how nice the name looks doesn't really matter and (2) __reduce_ex__ will allow more flexibility if anyone wants to use the pickle version within the method in the future.

I've deleted the metric tests. It's a minimal implementation. I think it's obvious that the tests will pass with NewMetric.

👍

stsievert · 2020-09-19T21:16:27Z

(1) this is a private ecosystem method, not something users will really be looking at so how nice the name looks doesn't really matter

I'm concerned with maintenance cost. Will some developer waste an hour looking into the difference between __reduce__ and __reduce_ex__ because the protocol argument isn't used?

(2) __reduce_ex__ will allow more flexibility if anyone wants to use the pickle version within the method in the future.

I don't think "more flexibility" is relevant; I think __reduce__ is as flexible as __reduce_ex__ because because the protocol argument isn't used in this implementation.

adriangb · 2020-09-19T21:29:55Z

I do see you point, but I think there's arguments either way. If you feel strongly about it and are 100% sure that we can use __reduce__ and still get the full benefits of protocol 5 support because we offload to numpy, then by all means switch it back, I do not intend to hold this back for something that minor.

rfcs/20200902-pickle-for-keras.md

adriangb · 2020-09-21T17:52:48Z

I think we are pretty much good to go.

@stsievert, do you want me to change __reduce_ex__ back to __reduce__ before submitting this?

stsievert · 2020-09-21T18:11:14Z

change __reduce_ex__ back to __reduce__

I think @jakirkham is the right person to ask.

I'm pretty sure zero-copy transmission remains possible with __reduce__, but am not absolutely certain. The documentation in PEP 574 seems to indicate that zero-copy transmission remains possible with __reduce__:

We see that several conditions are required for the above to work:

__reduce__ or __reduce_ex__ must be able to return something that indicates a serializable no-copy buffer view.

...

adriangb · 2020-09-21T18:25:34Z

Thank you, will wait on their response and then send this out.

jakirkham · 2020-09-21T18:52:00Z

Yeah it shouldn't matter if we are just using NumPy to handle out-of-band pickling.

adriangb · 2020-09-21T18:54:59Z

Thank you all for your contributions. I'm going to merge this and move this to the TF repo.

adriangb · 2020-09-21T18:57:51Z

@stsievert can you sign the Google CLA to fix tensorflow#286 (comment) if you want to keep authorship of your commits?

stsievert · 2020-09-21T19:10:16Z

I don't mind losing Git authorship. Feel free to remove it.

incorporate suggestions

e6d424d

adriangb force-pushed the keras-pickle-edits branch from 637b72f to e6d424d Compare September 2, 2020 23:11

Update README.md

2345c40

add note aboute SaveModel

cb48f7f

Edit motivation

fc3df6a

Technical edits

d3b3205

stsievert added 7 commits September 3, 2020 23:59

Edits

a882e8c

Single file

610d13f

better wording

a06e2ac

spell check

b98f2be

Link to blog.dask

8ed3397

Reorganizing

cd6278a

Reorganizing

265aee4

Add title and status

3ccd7c5

stsievert added 2 commits September 6, 2020 14:21

Edit title

42d5a07

Add link to Dask blog post

a396a6e

adriangb added 2 commits September 11, 2020 15:17

Update 20200902-pickle-for-keras.md

a7f963c

Clarify support for Pickle 5

0d4d273

stsievert commented Sep 15, 2020

View reviewed changes

rfcs/20200902-pickle-for-keras.md Outdated Show resolved Hide resolved

adriangb added 2 commits September 15, 2020 11:50

fix typo

d0a9295

__reduce__ -> __reduce_ex__ for PEP 574 support

81d3b7b

stsievert added 2 commits September 18, 2020 22:41

Update 20200902-pickle-for-keras.md

a160d72

Delete metric tests

957c1ba

adriangb reviewed Sep 19, 2020

View reviewed changes

Change metric file

0c6ef25

stsievert added 2 commits September 19, 2020 16:31

Model file

de2d703

Update 20200902-pickle-for-keras.md

b393fe9

stsievert commented Sep 19, 2020

View reviewed changes

rfcs/20200902-pickle-for-keras.md Show resolved Hide resolved

Define temp_ram_location

349b649

__reduce_ex__ -> __reduce__

42e4a63

adriangb merged commit 62d98dc into keras-pickle Sep 21, 2020

adriangb deleted the keras-pickle-edits branch September 21, 2020 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft of Keras pickle RFC #1

Draft of Keras pickle RFC #1

stsievert commented Sep 2, 2020

adriangb commented Sep 2, 2020

adriangb commented Sep 2, 2020 •

edited

Loading

adriangb commented Sep 3, 2020 •

edited

Loading

adriangb commented Sep 3, 2020

adriangb commented Sep 3, 2020

stsievert commented Sep 3, 2020 via email

adriangb commented Sep 3, 2020

stsievert commented Sep 3, 2020

adriangb commented Sep 3, 2020

stsievert commented Sep 3, 2020 •

edited

Loading

adriangb commented Sep 3, 2020

stsievert commented Sep 4, 2020 •

edited

Loading

mrocklin commented Sep 4, 2020

jakirkham commented Sep 9, 2020

jakirkham commented Sep 9, 2020

jakirkham commented Sep 9, 2020

adriangb commented Sep 18, 2020

stsievert commented Sep 19, 2020

adriangb Sep 19, 2020

adriangb commented Sep 19, 2020 •

edited

Loading

stsievert commented Sep 19, 2020

adriangb commented Sep 19, 2020

adriangb commented Sep 21, 2020

stsievert commented Sep 21, 2020

adriangb commented Sep 21, 2020

jakirkham commented Sep 21, 2020

adriangb commented Sep 21, 2020

adriangb commented Sep 21, 2020

stsievert commented Sep 21, 2020

Draft of Keras pickle RFC #1

Draft of Keras pickle RFC #1

Conversation

stsievert commented Sep 2, 2020

adriangb commented Sep 2, 2020

adriangb commented Sep 2, 2020 • edited Loading

adriangb commented Sep 3, 2020 • edited Loading

adriangb commented Sep 3, 2020

adriangb commented Sep 3, 2020

stsievert commented Sep 3, 2020 via email

adriangb commented Sep 3, 2020

stsievert commented Sep 3, 2020

adriangb commented Sep 3, 2020

stsievert commented Sep 3, 2020 • edited Loading

adriangb commented Sep 3, 2020

stsievert commented Sep 4, 2020 • edited Loading

mrocklin commented Sep 4, 2020

jakirkham commented Sep 9, 2020

jakirkham commented Sep 9, 2020

jakirkham commented Sep 9, 2020

adriangb commented Sep 18, 2020

stsievert commented Sep 19, 2020

adriangb Sep 19, 2020

Choose a reason for hiding this comment

adriangb commented Sep 19, 2020 • edited Loading

stsievert commented Sep 19, 2020

adriangb commented Sep 19, 2020

adriangb commented Sep 21, 2020

stsievert commented Sep 21, 2020

adriangb commented Sep 21, 2020

jakirkham commented Sep 21, 2020

adriangb commented Sep 21, 2020

adriangb commented Sep 21, 2020

stsievert commented Sep 21, 2020

adriangb commented Sep 2, 2020 •

edited

Loading

adriangb commented Sep 3, 2020 •

edited

Loading

stsievert commented Sep 3, 2020 •

edited

Loading

stsievert commented Sep 4, 2020 •

edited

Loading

adriangb commented Sep 19, 2020 •

edited

Loading